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Abstract 

Multimode preceding, where the number of independent data-streams is adapted optimally, can be 
used to maximize the achievable throughput in multi-antenna communication systems. Motivated by 
standardization efforts embraced by the industry, the focus of this work is on systematic precoder design 
with realistic assumptions on the spatial correlation, channel state information (CSI) at the transmitter and 
the receiver, and implementation complexity. For spatial correlation of the channel matrix, we assume a 
general channel model, based on physical principles, that has been verified by many recent measurement 
campaigns. We also assume a coherent receiver and knowledge of the spatial statistics at the transmitter 
along with the presence of an ideal, low-rate feedback link from the receiver to the transmitter. The 
reverse link is used for codebook-index feedback and the goal of this work is to construct precoder 
codebooks, adaptable in response to the statistical information, such that the achievable throughput 
is significantly enhanced over that of a fixed, non-adaptive, i.i.d. codebook design. We illustrate how 
a codebook of semiunitary precoder matrices localized around some fixed center on the Grassmann 
manifold can be skewed in response to the spatial correlation via low-complexity maps that can rotate 
and scale submanifolds on the Grassmann manifold. The skewed codebook in combination with a low- 
complexity statistical power allocation scheme is then shown to bridge the gap in performance between 
a perfect CSI benchmark and an i.i.d. codebook design. 
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I. Introduction 

Research over the last decade has firmly established the utility of multiple antennas at the transmitter 
and the receiver in providing a mechanism to increase the reliability of signal reception [1], or the rate 
of information transfer [2], or a combination of the two. The focus of this work is on maximizing the 
achievable rate under certain communication models that are motivated by wireless systems in practice. In 
particular, we assume a limited (or quantized) feedback model [3] with perfect channel state information 
(CSI) at the receiver, perfect statistical knowledge of the channel at the transmitter, and a low-rate 
feedback link from the receiver to the transmitter. In this setting, the fundamental problem is to determine 
the optimal signaling/feedback scheme that maximizes the average mutual information given a statistical 
description of the channel, signal-to-noise ratio (SNR), the number of antennas, and the quality of limited 
feedback. 

A low-complexity approach to solving this problem is to first determine the rank of the optimal precoder 
as a function of the statistics, SNR, and the quality of feedback. The design of the optimal scheme is 
then, in principle, essentially the same as that of a fixed rank limited feedback precoder whose rank is 
adapted optimally. Motivated by this line of reasoning, the main theme of this work is the construction 
of a systematic, yet low-complexity, limited feedbackprecoding scheme (of a fixed rank) that results in 
significantly improved performance over an open-looju scheme. Towards this goal, we consider a simple 
block fading/narrowband setup where spatial correlation is modeled by a mathematically tractable channel 
decomposition [4]-[6], and includes as special cases the well-studied i.i.d. model [2], the separable 
correlation model [7], and the virtual representation framework [8], [9]. Furthermore, we also assume 
that the power-constrained input signals come from some discrete constellation set whereas the decoder 
is assumed to have a simple, linear architecture hke the minimum mean-squared error (MMSE) receiver. 

While precoding has been studied extensively under the i.i.d. model [10]-[18], considerable theoretical 
gaps exist in the limited feedback setting. The extreme case of limited feedback beamforming has been 
studied in the i.i.d. setting where the isotropicitj^ of the dominant right singular vector of the channel 
can be leveraged to uniformly quantize the space of unit-normed beamforming vectors, a problem 
well-studied in mathematics literature as the Grassmannian line packing (GLP) problem [19], [20]. 

'There is no correlation information at the transmitter in an open-loop scheme. That is, the channel is assumed to be i.i.d. 
and an i.i.d. codebook design is used. 

^Isotropic means that the dominant right singular vector is equally likely to point along any direction in the ambient transmit 
space. This ambient space of all possible right singular vector(s) is referred to as the Grassmann manifold. Precise definitions 
are provided later in the paper. 
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Alternate constructions based on Vector Quantization (VQ)/Random Vector Quantization (RVQ) are 
also possible [21], [22]. Spatial correlation, however, skews the isotropicity of the right singular vector, 
and hence poses a fundamentally more challenging problem. While VQ codebooks can be constructed 
for the correlated channel case, the construction suffers from high computational complexity and the 
codebook has to be reconstructed from scratch every time the statistics change, thus rendering VQ-type 
solutions impractical. Recently, beamforming codebooks that can be easily adapted to statistical variation 
(with low-complexity transformations) have been proposed [23]-[25]. The other extreme, limited feedback 
spatial multiplexing, has also been studied [26], [27]. 

In the intermediate settingjof rank-Af precoding, under the i.i.d. assumption, the isotropicity property 
of the dominant right singular vector of the channel extends to the subspace spanned by the M-dominant 
right singular vectors thereby allowing a Grassmannian subspace packing solution [28]. In the correlated 
case, the fundamental challenge on how to non-uniformly quantize the space of M -dominant right singular 
vectors remains the same as in the beamforming case. However, unlike the beamforming case, it is not 
even clear how a codebook designed for i.i.d. channels can be skewed in response to the correlation. 
In fact, using an i.i.d. codebook design in a correlated channel can lead to a dramatic degradation in 
performance (see Figs. [3] and |4l). 

Our main goal here is to construct a systematic semiunitarjQ precoder codebook that is tailored to the 
spatial correlation, and is easily adaptable in response to a change in statistics. The heuristic behind our 
construction comes from our previous study of the asymptotic performance of the statistical precoder [29]. 
We showed in [29] that the performance of the statistical precoder is closest to the optimal precoder 
when the number of dominant transmit eigenvalues is equal to the rank of the precoder, these dominant 
eigenvalues are well-conditioned, and the receive covariance matrix is also well-conditioned. A channel 
satisfying the above conditioning properties is said to be matched to the communication scheme. Thus, 
while limited (or even perfect) feedback can only lead to marginal performance improvement in matched 
channels, in the case of mismatched channels where the relative gap in performance between the statistical 
and the optimal precoders is usually large, the potential benefits of limited feedback are more significant. 

This study [29] suggests that spatial correlation orients the directivity of the M -dominant right singular 
vectors of the channel towards the statistically dominant subspaces and hence, a non-uniform quantization 
of the local neighborhood around the statistically dominant subspaces is necessary. The realizability of 
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Here, 1 < M < min(A'^t, Nr) with Nt and A'^,. denoting tlie transmit and the receive antenna dimensions. 
An Nt X M matrix X with M < Nt is said to be semiunitary if it satisfies X^X = Im- 
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such a non-uniform quantization with low-complexity, as well as its adaptability, are eased by mathe- 
matical maps that can rotate a root codeset (or a submanifold) centered at some arbitrary location on the 
Grassmann manifold Q{Nt, M) towards an arbitrary center and scale it arbitrarily. 

Our design includes a statistical component of dominant M-dimensional subspaces of the transmit 
covariance matrix, a component corresponding to local quantization around the statistical component, 
and an RVQ component which can be constructed with low-complexity. In this context, our construction 
mirrors and generalizes our recent work in the beamforming case [25]. By combining a semiunitary 
codebook (of a small enough cardinality) with a low-complexity power allocation scheme that is related 
to statistical waterfilling, we show via numerical studies that significant performance gains can be achieved 
and the gap to the perfect CSI scheme can be bridged considerably. 

Organization: The system setup is introduced in Section |lll In Section |llll we introduce the notion of 
mismatched channels where limited feedback preceding results in significant performance improvement. 
In Section |IVl limited feedback codebooks that enhance performance are proposed and in Section |Vl 
mathematical maps are constructed to realize these designs with low-complexity. Numerical studies are 
provided in Section |Vl] with a discussion of our results and conclusions in Section IVIII 
Notation: The M-dimensional identity matrix is denoted by I a/- We use X(i,j) and X(i) to denote 
the j-th and z-th diagonal entries of a matrix X. In more complicated settings {e.g., when the matrix 
X is represented as a product or sum of many matrices), we use Xj to denote the i,j-th entry. The 
complex conjugate, conjugate transpose, regular transpose and inverse operations are denoted by (•)*, 
(•)^, (•)^ and (•)"^ while E[-], Tr(-) and det(-) stand for the expectation, the trace and the determinant 
operators, respectively. The i-dimensional complex vector space is denoted by C*. We use the ordering 
Ai(X) > • • • > A„(X) for the eigenvalues of an n x n-dimensional Hermitian matrix X. The notations 
Aniax(X) and Amin(X) also stand for Ai(X) and A„(X), respectively. 

II. System Setup 

We consider a communication model with Nt transmit and A^^^. receive antennas where M (1 < M < 
min(A'^t, Nr)) independent data-streams are used in signaling. That is, the M-dimensional input vector s 
is precoded into an A'^t-dimensional vector via the Nt x M preceding matrix F and transmitted over the 
channel. The discrete-time baseband signal model used is 

y = HFs + n (1) 

where y is the A'^ -dimensional received vector, H is the Nr x Nt channel matrix, and n is the Nr- 
dimensional zero mean, unit variance additive white Gaussian noise. 
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A. Channel Model 

We assume a block fading, narrowband model for the correlation of the channel in time and frequency. 
The main emphasis in this work is on channel correlation in the spatial (antennas) domain. The spatial 
statistics of H depend on the operating frequency, physical propagation environment which controls the 
angular spreading function and the path distribution, antenna geometry (arrangement and spacing) etc. It 
is well-known that Rayleigh fading (zero mean complex Gaussian) is an accurate model for H in a non 
line-of-sight setting, and hence the complete spatial statistics are described by the second-order moments. 

The most general, mathematically tractable spatial correlation model is a canonical decompositio^ 
of the channel along the transmit and the receive covariance bases [4]-[6]. In the canonical model, we 
assume that the auto- and the cross-correlation matrices on both the transmitter and the receiver sides 
have the same eigen-bases, and therefore we can decompose H as 

H = U,HindUf (2) 

where Hind has independent, but not necessarily identically distributed entries, and Uj and Ur are unitary 
matrices. The transmit and the receive covariance matrices are given by 

^t = E [H^H] = VtE [H^dHind] Uf = U^A^Uf 

^r = E [HH^] = VrE [HindH^fd] Uf = A, (3) 

where At = E [Hf ^Hind] and A,. = E [HindHf are diagonal. Under certain special cases, the model 
in ([2]) reduces to some well-known spatial correlation models [4]: 

• The case of ideal channel modeling assumes that the entries of Hind are i.i.d. standard complex 
Gaussian random variables [2]. The i.i.d. model corresponds to an extreme where the channel is 
characterized by a single independent parameter, the common variance. 

1 1/2 1/2 

• When Hind is assumed to have the form • A^ HiidA, with Hiid an i.i.d. channel matrix 
and the channel power pc = Tr(At) = Tr(A,.), the canonical model reduces to the often-studied 
normalized separable correlation framework where the correlation of channel entries is in the form 
of a Kronecker product of the transmit and the receive covariance matrices [7]. The separable model 
is described by no more than Nt + Nr independent parameters corresponding to the eigenvalues 
{Atii)} and {Ar{i)}. 



^This model is referred to as the "eigenbeam/beamspace model" in [5] and is used in capacity analysis in [6]. 
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• When uniform linear arrays (ULAs) of antennas are used at the transmitter and the receiver, JJt and 
JJr are well-approximated by discrete Fourier transform (DFT) matrices and the canonical model 
reduces to the virtual representation framework [8], [9], [30]. In contrast to the general model in Q, 
the virtual representation offers many attractive properties: a) The matrices Uf and are fixed and 
independent of the underlying scattering environment and the spatial eigenfunctions are beams in 
the virtual directions. Thus, the virtual representation is physically more intuitive than the general 
model in ([2]), b) It is only necessary that the entries of Hind be independent, but not necessarily 
Gaussian, a criterion important as antenna dimensions increase, and c) The case of specular (or 
line-of-sight) scattering can be easily incorporated with the virtual representation framework [30]. 
In contrast to the separable model, the virtual representation can support up to NtNr independent 
parameters corresponding to the variances of {Hjnci(^, j)}- 
While performance analysis is tractable in the i.i.d. case, it is unrealistic for applications where large 
antenna spacings or a rich scattering environment are not possible. Even though the separable model 
may be an accurate fit under certain channel conditions [31], deficiencies acquired by the separability 
property result in misleading estimates of system performance [4], [32], [33]. The readers are referred 
to [5], [32], [34] for more details on how the canonical/virtual models fit measured data better. 

B. Channel State Information 

If the fading is sufficiently slow, perfect CSI at the receiver is a reasonable assumption for practical 
communication architectures that use a "training followed by signaling" model. Even in scenarios where 
this may not be true {e.g., a highly mobile setting), the performance with imperfect CSI at the receiver can 
be approximated reasonably accurately by the perfect CSI case along with an SNR-offset corresponding 
to channel estimation. Thus in this work, we will assume a perfect CSI (coherent) receiver architecture. 
However, obtaining perfect CSI at the transmitter is usually difficult due to the high cost associated with 
channel feedback/reverse-link training^. 

On the other hand, the statistics of the fading process change over much longer time-scales and can be 
learned reliably at both the ends. In addition, recent technological advances have enabled the possibility 
of a few bits of quantized channel information to be fed back from the receiver to the transmitter at 

*In case of Time-Division Duplexed (TDD) systems, the reciprocity of tiie forward and the reverse hnks can be exploited to 
train the channel on the reverse link. In case of Frequency-Division Duplexed (FDD) systems, the chaimel information acquired 
at the receiver has to be fed back. 
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regular intervals. The most common form of quantized channel information is via a limited feedback 
codebook C of 2^ codewords known at both the ends. In this setup, the receiver estimates the channel at 
the start of a coherence block and computes the index of the optimal codeword from the codebook C for 
that realization of the channel according to some optimality criterion. It then feeds back the index of the 
optimal codeword with B bits over the Umited feedback link which is assumed to have negligible delay 
and essentially no errors (since B is usually small). The transmitter exploits this information to convey 
useful data over the remaining symbols in the coherence block. 

C. Transceiver Architecture 

The transmitted vector Fs (see ^) has a power constraint p. Assuming that the input symbols s(A;) 
have equal energy the precoder matrix satisfies Tr(F^F) < M. Non-linear maximum likelihood (ML) 
decoding of the transmitted data symbols using knowledge of H at the receiver is optimal. However, ML 
decoding suffers from exponential complexity, in both antenna dimensions and coherence length. Thus 
in practice, a simple linear receiver architecture like the MMSE receiver is preferred. With this receiver, 
the symbol corresponding to the k-th data-stream is recovered by projecting the received signal y on to 
the A^r X 1 vector 

(^HFF^H^ + I^^) Hffc (4) 

where is the A;-th column of F. That is, the recovered symbol is s{k) = ^J^gj^y- The signal-to- 
interference-noise ratio (SINR) at the output of the linear filter is 

_P_|„_ffTTf I 2 1 

SINRfc = , MjSk^ i 

gf {ii Hf.ff H^^ + In.) gk (Im + ^F^H^HF)-;, 

where the second equality follows from the Matrix Inversion Lemma. 

The outputs s{k) are passed to the decoder and we assume separate encoders/decoders for each data- 
stream, as well as independent interleavers and de-interleavers, which reduces the correlation among the 
interference terms at the outputs of the receiver filters. The performance measure is the mutual information 
between s and s. Assuming that the interference plus noise at the output of the linear filter has a Gaussian 
distribution, which is true with Gaussian inputs and is a good approximation in the non-Gaussian setting 
when {M, Nf, Nr} are large, the mutual information is given by 

/(s;s) = 5^1og2 (1 + SINRfc) = -^log2 f (ijv/ + ^F^H^Hf)^ J . (6) 
fc=i fc=i ' 
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When perfect CSI is available at the transmitter and no constraints are imposed on the structure of the 

~ 1/2 

precoder, the optimal precoder Fperf is channel diagonalizing and is of the form Fperf = VhAJj: where 
VhAhV^ is an eigen-decomposition of H^H with the eigenvalues arranged in non-increasing order, 
Vh is the Nt x M principal submatrix of Vh, and A^f is an M x M matrix with non-negative entries 
only along the leading diagonal and these entries are obtained by waterfilling. In this setting, the mutual 
information is given by 

M 

Iperf(s;s) = J^log2 (l + ^AH(A:)Awf(A:)) . (7) 

k=l 

The optimality of Fperf with other choices of objective functions is also known; see [10]-[18]. 

D. Limited Feedback Framework 

The focus of this work is on understanding the implications of partial CSI at the transmitter on the 
performance of the precoding scheme. In particular, there exists a codebook of the form C = {Fj,i = 
1, • • • , 2^} where Fj is an Nt x M precoder matrix with Tr(F^Fj) < M. The most general structure 

1/2 

for Fj is Fj = VjA/ where Vj is an Nt x M semiunitary matrix and Aj is an M x M non-negative 
definite, diagonal power allocation matrix. While the structure of the optimal limited feedback codebook 
of B bits could involve allocating some fraction of B to the power allocation component of Fj, numerical 
studies indicate that the degradation in performance is minimal when Aj is chosen to be fixed (say, Agtat 
with Tr(Astat) < M), but designed appropriately, as a function of SNR if necessary, so that it can be 
easily adapted to statistical variations without recourse to Monte Carlo methodj^ 

Motivated by this heuristic, in this work, all the B bits in limited feedback are allocated to quantize 
the eigenspace of the channel. That is, the codebook is C = {Vj : Vf^Vj = 1^} and the index of the 
codeword that is fed back is 

f = argmax |-^log2 ((Im + -^A,Y>f VHAnVgV.A.Yat) ^ 'J | • (8) 

Although computing j* is straightforward, the design of an optimal codebook to maximize /(s; s) seems 
difficult. Here, we adopt a suboptimal strategy where the goal is to maximize the average projection of 
the best codeword from C onto Vh. Towards the precise mathematical formulation of this problem, we 
need a metric to define distance between two semiunitary matrices. 



^The design of Astat will be dealt with in Sec. IIVI 
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E. Distance Metrics and Spherical Caps on the Grassmann Manifold 

We now recall some well-known facts about the Grassmann manifold. The unit sphere in C^', 
also known as the uni-dimensionaj^ complex Stiefel manifold St(A'^t,l), is defined as St(A^t,l) = 
{x e C^* : ||x|| = l}. The invariance of any vector x to transformations of the form x i-^ e-^'^x in 
the above definition is incorporated by considering vectors modulo the above map. The partitioning of 
St(A'^t, 1) by this equivalence map results in the uni-dimensional Grassmann manifold G{Nt, 1). In short, 
the Grassmann manifold corresponds to a linear subspace in an Euclidean space. Similarly, the class of 
Nt X M semiunitary matrices forms the M-dimensional complex Stiefel manifold St(A'^t, M) and points 
on the Af -dimensional complex Grassmann manifold Q{Nt,M) are identified modulo the M-dimensional 
unitary space. 

A literature survey of packings on Q{Nt, 1) [35]-[37] shows that many distance metrics are equivalent 
to the dot product metric which is the most natural metric from an engineering perspective. The dot 
product metric is defined as (i(xi,X2) = — |xi^X2p. Using this distance metric, for any 7 < 1, 
we can define a spherical cap with center o and radius 7 (as a submanifold on G{Nt, 1)) as the open 
set 0(0,7) = {x G G{Nt, 1) : d(x, o) < 7} . A spherical cap on G{Nt, 1) induces a spherical cap on 
St(A'^t, 1) via the equivalence partitioning generated by the map x 1-^ e-^'^x. 

In the more general M > 1 case, there is no unique distance metric extension. While various well- 
defined distance metrics can be pursued, we will focus on the projection 2-norm distance metric [36]. 
Here, the distance between two Nt x M semiunitary matrices Vi and V2 is defined as 

dproj,2(Vi, V2) = Aniax (ViVf - V2Vf ) . (9) 

A particular choice of the distance metric is not extremely critical in precoder optimization since code- 
books designed with different choices of distance metrics result in near-identical performance [28], [29]. 
In addition to this fact, the following lemma shows that the projection 2-norm metric is attractive by 
being a natural generalization of the dot product metric. 

Lemma 1: In the M = 1 case, the projection 2-norm metric reduces to the standard dot product metric. 
Proof: Let vi and V2 be two unit-normed A^^ x 1 vectors. Then, the projection 2-norm distance 
between vi and V2 is defined as (iproj,2(vi, V2) = Amax (viv^ — V2v|^). We can write the matrix within 
the Amax(') operation as [vi V2] [vi — ^2]^ ■ Since the non-trivial eigenvalues of a matrix product AB 

^Uni-dimensional because its definition is based on tlie norm of an A^t x 1 vector. 
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are the same as those of BA, we need the largest eigenvalue of 



X 



Vi V2 



1 

-vfvi 



vf V2 

-1 



(10) 



Expanding the characteristic equation of X, det(X — AI2) = 0, we have = 1 — |vj^V2p. Using the 
positive root for Amax, the lemma follows immediately. ■ 
Proposition 1: We now state some properties of the projection 2-norm metric: 

1) 0<dproj,2(Vi,V2) < 1, 

2) More precisely, dproj, 2(Vi, V2) = ^Jl- Amin(Vf VaV^^Vi), and 

3) Equality in the lower bound of 1) occurs if and only if Vi = V2 on Q{Nt,M) while equality is 
possible in the upper bound if and only if Ajnin(V^V2V|^Vi) = 0. 

Proof: The proof is provided in three parts. 
1) Using the fact that ViVf^ — ^2^2 Hermitian and its trace equals zero, we see that Aniax(ViVf^ — 
^2^2^) < is impossible. For the upper bound, note that 



Amax(ViVf - V2V^) < A„,ax(ViVf ) = A^ax(Vf Vi) = 1. (11) 

2) We the need the following result [38] that helps in computing the determinant of partitioned matrices. 



Lemma 2: If X, Y, Z and W are n x n matrices and W is invertible, we have 

= det(X- YW^^Z) -detCW). (12) 

■ 

Using the above fact and the trick (in LemmaH]) of rewriting the eigenvalues of AB in terms of eigenvalues 
of BA, 2) follows trivially. 

3) If dproj, 2(Vi, V2) = 0, then it is easy to see that ViV^ = V2V|^ from which we note that 
Vi = V2V^Vi. Observe that Vf Vi is M x M and unitary, and hence, Vi = V2 on g{Nt, M). The 
other direction of the statement follows trivially. Both the directions of the upper bound follow from the 
expression in 2). ■ 
The trick in proving Lemma [T] and statement 2) in Prop. [U is useful and will be used again in the 
construction of the scaling map (see Appendix |B]|. Once a choice of distance metric has been settled, the 
definition of a spherical cap with center O and radius 7 (as a submanifold on Q{Nt,M)) follows naturally 
as the open set 0(0,7) = ^ Q{Nt,M) : (iproj,2 (X;0) < 7} • The codebook design problem can 



det 



X Y 
Z W 
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now be simply stated as: 



mill dpi.oj,2(Vj, Vh] 



Construct C = {Vi,i = !,••• ,2-^} s.t. En 
We now work towards a systematic codebook construction for this problem. 



is minimized. 



III. Matched Versus Mismatched Channels 

The case of unstructured precoding with genie-aided perfect CSI was summarized in Sec. III-CI which 
resulted in Fperf = VhA^'[^. The construction of Vh, as well as A^f, necessitates the tracking of the 
channel evolution which is difficult. To avoid this problem and to reduce the complexity of precoding, 
the following structured precoding was introduced in [29]. 

1/2 

• When the precoder is assumed to be structured as F = VAs^t with V an A'^t x M semiunitary 
matrix, and Astat an M x M fixed, rank-Af power allocation matrix, the optimal choice of V under 
perfect CSI is Vh- This optimality is assured for many different classes of objective functions apart 
from the case of maximizing mutual information. When only statistical information is available at 
the transmitter, the optimal choice of V is Vstat where Vstat is a set of M dominant eigenvectors 
of Ylt, the transmit covariance matrix. We call these two schemes optimal and statistical structured 
precoding schemes, respectively. 

• We study the performance loss between these two schemes as a function of the channel statistics. 
When one antenna dimension grows to infinity at a rate faster than the otheiQ which we refer to as 
the relative antenna asymptotics case, channel hardening leads to convergence of the right singular 
values of the channel to the eigenvalues of and hence, ensures that the statistical scheme performs 
near-optimally. This conclusion generalizes prior results in the beamforming case where statistical 
beamforming is shown to be near-optimal in the relative antenna asymptotics setting [25]. 

• Further, for any reasonably large (but fixed) value of antenna dimensions, the relative performance 
loss between the two schemes is minimized by the following choice of statistics: 1) The set of transmit 
eigenvalues {Af(i)} can be partitioned into two components: a well-conditioned component of M 
dominant eigenvalues, and the remaining Nt — M transmit eigenvalues are ill-conditioned away from 
the dominant set, and 2) The set of receive eigenvalues {A.j.{i)} are well-conditioned. In particular, 
if Tr(St) = Tr(5]r) = NtNr, the structure of A^ and A,, that minimizes performance loss is 
At{l) = ■■■ = At{M) = i^,AtiM + l) = ■■■ = At{Nt) = 0, and A,(l) = • • • = A,(iV,) = Nt. 

'That is, when ^ — > or cx3 as {Nt, Nr} oo. 
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Such a channel is said to be matched to the precoding scheme. On the other extreme, statistical 
structured precoding in an i.i.d. channel leads to very high performance loss when compared with 
the optimal scheme. Thus, an i.i.d. channel is mismatched to the precoding scheme. More important 
to note is that any feedback (limited or otherwise) is helpful only in mismatched chaimels and only 
when the transmit and the receive dimensions are proportionate. This conclusion is a generalization 
of our earlier beamforming result [25]. 

The readers are referred to [29] for details. Henceforth, the focus will be on mismatched channels 
primarily because the potential to bridge the performance gap between the statistical and perfect CSI 
schemes is maximum. Our goal is to construct a systematic, statistics-dependent codebook (of a fixed 
size 2^) that ensures this bridging. 

IV. Quantized Feedback Designs to Bridge the Performance Gap 

In contrast to the i.i.d. case where the isotropicity of the right singular subspace of the chaimel leads to 
a design [28] based on Grrassmaimian subspace packings [37], spatial correlation skews this isotropicity 
and poses fundamental challenges. The study of statistical precoding motivates the following heuristic 
in the correlated case. While the asymptotic chaimel hardening (and the consequent near-optimaUty of 
statistical precoding) does not carry over when Nt and A^^ are small or when they are proportionate, it is 
expected that the distance between Vstat and Vh is small on average. Thus, when we have the freedom 
to pick more than one codeword (B > 0), the codewords should correspond to a "local quantization" of 
Vstat- The notion of local quantization will be made precise shortly. 

We now describe the codebook design for limited feedback precoding. Our design is a multi-mode 
generalization of the beamforming codebook proposed in [25], [39]. The differences between the two 
schemes lie in packing subspaces, rather than Unes, and in the choice of an appropriate distance met- 
ric. For this, we introduce the notion of generalized eigenvalues of subspaces of Sj. Consider the 
family of subspaces spaimed by M distinct eigenvectors of Sj. Note that there are (^) members 
in this family. For each such subspace, we associate a generalized eigenvalue defined as the M-fold 
product of the corresponding transmit eigenvalues. For example, if A'^t = 4 and M = 2 with the 
columns of Ut denoted by Ui,i = I,-- - ,4, the six subspaces correspond to the 4x2 matrices: 
[ui U2], [ui U3], [ui U4], [u2 U3], [u2 U4] and [U3 U4]. The generalized eigenvalue corresponding to [ui U2] 
is At(l)At(2) etc. Note that among all the M-dimensional subspaces of Xlj, the subspace spaimed by 
Vstat results in the largest generalized eigenvalue. 
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The proposed codebook design has three components: 1) a statistical component, 2) local perturbation 
components, and 3) an RVQ component. The cardinalities of these components are denoted by A'stat, ^loc 
and A'^rvq with the feedback rate defined hy B = log2(-^stat + -^loc + -^rvq)- 

Statistical Component: While the distance between Vstat and Vh, an instantaneous realization of the M- 
dominant right singular vectors of the channel is expected to be small on average, the precise probability 
distribution of this distance is determined by the separation (gap) between the generalized eigenvalues of 
Sf. For example, if the first two dominant generalized eigenvalues are close to each other, there is a non- 
negligible probability for the event that the best quantizer is the subspace whose generalized eigenvalue is 
the smaller of the two and hence, the distance between Vstat and the optimal precoder could be arbitrarily 
closed to 1. On the other hand, if the largest generalized eigenvalue of is much larger than the other 
generalized eigenvalues, the probability distribution of this distance is concentrated around zero. Thus 
the gap between the largest generalized eigenvalue and the other generalized eigenvalues heuristically 
determines the cardinality of the statistical component, iVstat- In our design, a threshold (3 is chosen 
a priori for the generalized eigenvalues and the statistical component consists of all M-dimensional 
subspaces such that their generalized eigenvalue exceeds the threshold. That is, the statistical component 
is the set 5 = |z : ^ > /?! where /Uj are the M-fold generalized eigenvalues of T^t and ixi is the largest 
generalized eigenvalue. The cardinality of S is A'^jtat- 




Fig. 1. Proposed Codebook Design for Nt = 3,M = 2, and 5 = 3 witli only the statistical and local components. 

Local Components: For the i-th member of the statistical component, we construct A'^i^^^ codewords so 

'"Note from Prop. [T] that the distance between the first two dominant eigen-spaces of St is 1. This is because 
Amin(Vf^V2V|'^Vi) — where Vi and V2 denote the first two dominant eigen-spaces. 
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that they are localized and well-packed around the corresponding statistical codeword. While these local 
codewords can theoretically be designed via VQ, we provide low-complexity alternatives in Sec. IVl where 
we also elaborate on the notions of localized and well-packed. The choice of N^-^^ is in proportion to 
the generalized eigenvalue of the subspace. The heuristic behind this choice is as follows: The larger the 
separation of the generalized eigenvalue fii (corresponding to Vjtat) from the next largest generalized 
eigenvalue or the more matched 5]^ is, the lesser the relevance of the less-dominant subspaces in terms 
of preceding and hence, the smaller the values of {A'^^oc}'^ > ^ need to be. These A'^^oc = Si^T -^loc 
codewords form the local component of our codebook design. 

In Fig. \T\ we illustrate the design of a codebook with statistical and local components where Nt = 
3,M = 2, iVstat = 3, iV|^^ = iV|^^ = 2 and iV,^^ = 1. If JJt = [U1U2U3], then the three statistical 
transmit eigenspaces with M = 2 are those spanned by [U1U2], [U1U3] and [U2U3]. The "directions" 
corresponding to these subspaces are symbolically represented in the figure with dashed lines. The first 
local component consists of two codewords around [ui U2] and so on. Since there are eight codewords 
in our design, this codebook can be parameterized with B = 3 bits. 

RVQ Component: If B is sufficiently large, there is a need to refine the quantization of Vh- In this 
setting, A'^rvq = 2^ — A'stat — N]oc random channel matrices are generated according to the relationship 
in ^ and their M-dominant right singular vectors are used as the semiunitary precoder codewords in the 
RVQ component. Note that the RVQ component can be generated with low-complexity once the statistics 
are known perfectly. 

A. Power Allocation 

It is preferred that the power allocation matrix Astat be only dependent on the channel statistics 
and be easily adaptable to statistical variations. The optimal choice of Astat needs to be constructed 
via a Monte Carlo algorithm which is difficult to implement as well as adapt to statistical variations 
with low-complexity. As an alternative, we consider three low-complexity power allocations: 1) uniform 
power allocation across the excited modes, 2) waterfiUing based on At{i), i = 1, ■ ■ ■ , M, and 3) power 
allocation proportional to the transmit eigenvalues. The last two schemes have near-identical performances 
and are near-optimal in the low-SNR regime while uniform power allocation is more useful in the high- 
SNR regime. 
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B. Codeword Selection 

The receiver acquires the channel information at the start of a coherence block and it computes the 
index of the optimal codeword from the codebook that maximizes the instantaneous mutual information. 
The receiver then communicates to the transmitter the index of the optimal codeword with B bits. The 
transmitter uses the optimal codeword along with an appropriate power allocation to communicate over 
the remaining period in the coherence block. 




(b) 

Fig. 2. (a) Rotation of a root codeset of semiunitary precoders {V;, i = 1, ■ ■ ■ , 5} with Nt = 3 and M — 2. Tlie root codeset 
satisfies tfie localization and well-packing properties described in Sec. |V] The distance between any two precoders remains 
unchanged after rotation, (b) Scaling of the root codeset by a. The position of Vi remains unchanged after scaling. 

V. Rotating and Scaling Spherical Caps on Q{Nt, M) 

We now propose mathematical maps to ensure that the codebook design proposed above can be realized 
with low-complexity. For this, we need the notion of a root codeset. Let TZ = {Vj, i = !,•••, A^} be a 
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root codeseO of N semiunitary matrices satisfying the following properties which are characteristic of 
a 'good' local quantization: 

1) Localization: The root codeset is localized (centered) around Vi. That is, there exists a. 9 £ {0, 1) 
such that (iproj,2(Vi, Vj) < 9 for all i I. The smaller the value of 9, the more localized a 
packing. We often label Vi as the center of the root codeset. This is illustrated in Fig. |2] where a 
set of = 5 precoders form the localized root codeset in the Nt = M = 2 setting. 

2) Well-Packing: The codewords in TZ are well-packed (well-separated). That is, given some 7 S 
{'^■,lma.x{Nt,M,N,9)), the minimum distance of the packing dmin(7^) defined as dmin(7^) — 
minj^j dproj, 2(Vi, Vj) is larger than 7. The larger the value of 7, the well-packed TZ is. Hence 7 
can also be viewed as a measure of the packing density. Here, 7max(-^ti M, N,9) is the maximum 
possible packing densitjo achievable in the Grassmann manifold Q {Nt , M) with codewords 
localized in a cap of radius 9. 

Note that for any fixed choice of Nt,M and N, it is intuitive to expect that 7max(-^ti M, N, 9) decreases 
as 9 decreases. In other words, the above two properties aie in some sense conflicting with a root codeset 
that is more localized necessarily forced to have a small packing density and vice versa. 

Despite this apparent difficulty, it is important to note that a packing with the above properties can 
always be constructed, either via algebraic methods or via a vector quantization [21], [22] approach 
(that is, a brute force search via Monte Carlo-type algorithms). Furthermore, TZ needs to be constructed 
(offline) just once, and once this has been done, C can be designed for any statistics starting from IZ. For 
this, we now show how mathematical operations can be constructed to perform the following two tasks: 

1) Given a root codeset TZ oi N codewords with a packing density 7 and a target center Vtarget> how 
can we center TZ around Vtarget without having to resort to a VQ-type codebook construction againl 
That is, we seek a map to rotate the center of TZ to Vtarget without changing the packing density, and 

2) Given a root codeset TZ centered around Vi with a packing density of 7 and some fixed a G (0, 1), 
how can we scale TZ so that the packing density of the resultant codeset is 0:7? That is, we seek a map 
to reduce the minimum distance of TZ without changing its center. 

While we develop such maps for spherical caps/submanifolds, we will state the results as applicable 
to finite element subsets of Q{Nt, M). But prior to that, we recall results from a recent work [40] where 

"We use the term root codeset to indicate tliat tiie construction of C is rooted in the design of a 'good' TZ. 
'"While the exact characterization of "ymcixiNt, M, N,6) remains an open problem for general values of Nt,M, N and d, 
some bounds have been established; see [19], [24], [36], [37] and references therein. 
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rotation and scaling maps to solve 1) and 2) (as above) have been proposed in the beamforming case 
(M = 1). The rotation map is straightforward and is effected by an appropriately chosen unitary matrix. 
In contrast to the rotation operation, the scaling map requires some care due to the constraints of the 
space. For example, an operation of the form x i-^ ax where a e M yields a vector that is not unit-norm. 
It is to be noted that both rotation and scaling maps are non-unique. We summarize the map of [40] in 
the following lemma for M = 1. 

Lemma 3 (See [40]): Let TZ = {vj,i = ,A^} be a root codeset in ^(A^^,!) with a packing 

density 7 and center vi. The map that effects the rotation of vi to Vtarget is given by r(vj) = UtargetVj 
with Utarget Satisfying^ Vtarget = UtargetVi. For scaling by a, we first define a rotation map ryertex 
generated by a unitary matrix Uyertex that effects the rotation of the center vi to Vvertex = [1, 0, • • • ,0]^, 
a vertex of the unit cube. Then, define a vertex scaling map Svertex : ©(vvertexiT) ^ ©(vvertex, 07) by 



T 

... r-.r^i^N.^T^ 



(13) 



where we have denoted the vector in the argument on the left side of the above equation in its polar 
form. The map Sbf (•) defined as a composition Sbf = r^^rte^^ ° -^vertex ° ''vertex results in 



Sbf(vO = vi^l - a2(l - |vf Vi|2)e^^^"?-' + «vf v^'^v,. (14) 

It can be checked that Sbf(vi) = vi on Q{Nt, 1). Furthermore, the inner product of the second term with 
vi is zero. Hence, d (sbf (vj), Sbf (vi)) = d (sbf (vi), vi) = a(i(vj,vi) for all i. ■ 
The rotation and scaling maps to be proposed now generalize the result of [40] to the precoding 
scenario, M > 1. 

Theorem 1: Let TZ = {Vj, i = 1, - ■ ■ , N} be a root codeset centered around Vi with a packing density 
7. Let the NtxM semiunitary matrix Vtarget be the desired center of the rotated codeset. Then, the rotated 
codeset Q is given by ^ = {Gj, i = 1, • • • , N} where Gj = Uvt,,^^, Vj with unitary matrices Uvi 
and Uv..,,.. defined as Uv, = [Vi V^"] and Uv...... = [V^arget V-'^.^]. Here, V^" and V-'^.^ are 

Nt X [Nt — M) -dimensional representatives of the null-spaces of Vi and Vtarget> respectively. 

Proof: See Appendix [Al ■ 

''The readers are referred to [25] for details of the proof. 

'''One possible choice of Utarget is Utarget = [vtarget Vtl^rget] |^ Vi Vi j where vf^rget and Vi" refer to matrix 

representatives from the Nt x [Nt — 1) dimensional null-space of Vtarget and vi, respectively. That is, Vj'^'^Vi" = IjVt-i 
and Vj^' ^vi = OjVt-ixi- 
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Note that there exists more than one basis for the null-space and therefore the usage of the term 
"representative" in the statement of the theorem. The lack of a unique representative for the null-space 
is responsible for the non-uniqueness of the rotation map that can effect a desired rotation. 

Before we get into the most general form of the scaling map, we illustrate a special case of it so as to 
provide insights into the construction. As before, let TZ = {Vj, i = 1, • • • , N} be a root codeset centered 
around Vi with a packing density 7. Let Vi = [vi • • • vj\/] where Vj is an A'^t x 1 vector and is the i-th 
column of Vi. Define the map s(-) by 



Vl V2 • • • VM-1 /3VM + S^M 



+1 



(15) 

where (3 = ^l-a^{l- (Vf V,Vf Vi)), 6 = a^l- (Vf V,Vf Vi), and vm+i is or- 
thogonal to Vi (that is, v^^^^Vi = Oixm)- We illustrate three properties satisfied by s(-) which ensures 
that it can scale submanifolds. Noting that Vj, i = 1, • • • , M + 1 are orthonormal vectors in C^* and that 
+ 5^ = 1, it is straightforward to check that s(Vj)^s(Vj) = 1m- For s(Vi), note that (3=1 and 
(5 = which results in s(Vi) = Vi. 

Proposition 2: We also have d(s(Vi), s(Vj)) = ad(Vi, Vj) for any i ^ 1. Thus, s(-) induces the 
scaling of TZ by a. 

Proof : Note that d(s(Vi), s{\i)) d{Vi,s{Yi)) = A^ax(ViVf - s{Vi)s{Vi)'') 

= Amax(vAfv|^ - (/3va/ + (5vAf+l ) (/3va/ + 5^r M ) (16) 

where in (a) we have used s(Vi) = Vi and (b) follows from ([TSl l. Using the trick of Lemma [U observe 
that the square of Amax in the above equation satisfies A^^^^ = 1 ~ Wm^^^m + ^^Ai+i)]"^ = 1 — (3"^ = 
a^(l — Amin(V(^VjV^Vi)). The proof is complete by noting the value of d(Vi, Vj) from Prop. [T] ■ 
The choice of vm+i is not unique and it is not clear whether the map in ([TS] ) is unique modulo the 
choice of vjvf+i. Furthermore, note that when (Nt — M) > M, s(V,;) can be written as 

s(Vi) = ViAi + Vr"Bi (17) 

where Aj = diag([l, • • • , 1, /?]) and Bj has only one non-zero entry which is at the (M, M)-th location 
and its value is 6. In Appendix |B] we resolve the uniqueness issue and construct the most general form 
of s(-). We also show that the most general form of s(Vj) is of the form in (fTTl) for a suitable choice 
of Aj and Bj. 
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A. Reduction to the Beamforming Construction of Lemma \3\ 

Corollary 1: In the special case of M = 1, the scaling map proposed in ([T5] ) (and extended in 
Theorem |2] of Appendix |B]) is a generalization of the map proposed in Lemma [3] (see ([T4b). 

Proof: For the sake of simplicity, we denote the map constructed in ( fTSl ) as Sgen(')- write Sgen(') 
as Sgen(vi) = ^/AiVi + ^/l — Aiv""" where Aj = 1 — a^(l — |vf^Vj|^) and v""" is an A^'t x 1 unit norm 
vector orthogonal to vi. We now draw a correspondence between Sbf(') and Sgen(')- 

In Lemma [3j note that UvertexU^^^g^ = 1^^ which impUes that v;j'-v^'^ = Iat.-i. Using the fact that 
U^rtexUvertex = ^Nt' similarly we obtain v^'^vf- = 1^^ — vivf^. Using this in (fT4l) . we have 

5bf(vi) = yi^^!^2^1^>f^e^'^<"'vi + a(v,-vi(vfv,)) (18) 

= v^e^'^<^-vi+a(v,-vi(vfv,)). (19) 

It is straightforward but surprising to note that YizXi^x^ both unit norm and orthogonal to vi. 

Vl-|vfvi|2 

Further, note that vT^-^ = ^Y^^^^^H^^f^^^'^- setting '^^'^j^g^ '^^ representative of v"^" in 

the general framework, we see that Sbf(-) can be obtained up to a phase term. And since we operate on 
the Grassmann manifold which is impervious to right multiplication by terms of the form e^^, we have 
proved the corollary. ■ 

B. Low-Complexity Generation of Local Components 

We now illustrate how the theory of rotation and scaling maps can be used to construct precoding 
codebooks with low-complexity. 

Root Codeset Generation: A root codeset that satisfies the localization and well-packing conditions as 
described above is constructed via VQ. The number of codewords in the root codeset is larger than 
^loc ensure that any local component has a cardinality smaller than that of the root codeset. 

Furthermore, since the scaUng map can only ensure that the output packing is more localized than the 
input packing, we need to pick 9 sufficiently large, but smaller than 1. The quantity j^s,x{Nt, M, N,9) 
corresponding to the choices of A^^, M, N and 9 is determined via Monte Carlo techniques and some 7 
is chosen in the interval (0, 7max(-^t, -M', A^, ^)). 

Local Components: For each member of the statistical component, we rotate the root codeset (via the 
rotation map of Theorem [1]) to the Nt x M matrix corresponding to the subspace of "St in the statistical 
component. Then, each rotated codeset is scaled by a shrinking factor Oi = That is, we scale each 
rotated codeset in proportion to the generalized eigenvalue of that subspace. From each rotated codeset 
of A^ codewords, we retain N^^^,i = 1, • • • , A'stat codewords. The heuristic behind the choice of A'l'^^ 
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has been explained in the previous section. The same heuristic can be used to justify the choice of as 
well. 

C. Exploiting the General Structure of the Scaling and Rotation Maps 

We now delve into why a general form of the maps in Appendix |B] is useful. In many practical 
systems, it is desired that the precoder codebook has more structure so as to ensure implementation ease. 
For example, two commonly desired properties are: 

1) Bounded Gain Power Amplifier Architecture where we require 

maxm„ |Vi(m,n)| 

max — ■ — — < r/. (20) 

WidC miiimn I Vj(m,n)| 

The above condition is useful in ensuring that the power amplifiers used in the radio link chain are not 
driven to their operational limits. The most general form of the rotation and scaling maps allows one to 
search for a codebook that satisfies the above property in addition to the localization and well-packing 
properties, and 

2) Recursive Codebook Structure where a codebook of rank-A'smaii can be generated from a codebook of 
rank-A'^iarge (with A'^iarge > -^smaii) by retaining only a subset of A/'smaii columns from every precoder in the 
rank-A'^iarge codebook. This property is desired so as to minimize the algorithmic complexity of generating 
a family of codebooks of different ranks on the fly. The low-complexity property of the proposed maps 
and the offline generation of the root codesets of different ranks ensure that this issue is redundant with 
our codebook design. 

Thus, we strongly generalize the maps of [40] and as a by-product observe that even in the M = 1 
case, a rich family of maps can effect the scaling operation other than ([141 ). Additional structure in the 
codebook can also be accommodated to ease implementation complexity. 

VI. Numerical Results 

We now illustrate via numerical studies the performance gains possible with our codebook construction 
and the consequent bridging of the gap between statistical and optimal precoding. In the first study, we 
consider a 4 x 4 channel under the separable model with = diag ([14.98 0.50 0.26 0.26]) and 
Ar = diag ([15.5 0.25 0.15 0.10]). This choice ensures that the transmit/receive covariance matrices 
are both ill-conditioned and with M = 2, note that the channel is not matched to the precoder. We first 
generate a root codeset of = 4 codewords with 9 w 0.76 and 7 « 0.75 via VQ. Let {uj} be the 
column vectors of Uj. The codebook used for S = 1 satisfies Astat = 1 with the codeword corresponding 
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Fig. 3. (a) Average mutual information with Gaussian inputs in a 4 x 4 mismatched channel following a separable model. Two 
data-streams are used in signaling and a limited feedback codebook designed along the principle elucidated in Sec. llVl is used, 
(b) Error probability performance with the same codebook under QPSK inputs. 



to [ui U2] and A'^rvq = 1 while with B = 2, the codebook has an additional RVQ codeword and a local 
codeword around [ui U2]. Similarly, with B = 4, iVstat = 3, N^^^ = Nf^^ = 3, Nf^^ = 2 and = 5. The 
statistical codewords correspond to [ui Uj], i = 2, • • • ,4. Since we are mainly interested in illustrating 
the performance gains in the high-SNR regime, uniform power allocation is used for Astat- 

Fig. Oa) shows the average mutual information with a Gaussian input for statistical and limited feedback 
precoding. In addition to the mutual information, raw bit error rate (BER) is useful as well. Fig|3lb) shows 
the improvement in error probability in the same channel with QPSK inputs. In the error probability case, 
the index of the codeword that minimizes the distance to the instantaneous Vh is fed back. Note that 
while the performance gap between the optimal and the statistical schemes is significantly bridged in 
the error probability case, further improvement in mutual information is possible. Nevertheless, both the 
figures show that substantial gains are possible with a few bits of feedback. For example, with B = A 
bits of feedback, a 3 dB gain is possible at a rate of 10 bps/Hz while a 6 dB gain is possible at a BER 
of 10"^. Also, note that an i.i.d. codebook design incurs a dramatic loss in performance in correlated 
channels. 

In the second study, we consider a 4 x 4 channel with non-separable correlation following the virtual 
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representation framework. The variance matrix a{i,j) = E [|Hjnci(i, used in the study is 
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Note that the channel has a single dominant transmit (as well as receive) eigen-mode and is hence 
mismatched when M = 3 data-streams are used in signaling. The parameters of the root codeset are 
= 4, 9 ^ 0.87 and 7 0.84. As before, let {uj} be the column vectors of the DFT matrix U^. The 
codebook for = 1 has the two statistical codewords [U3 U2 ui] and [U3 U2 U4]. For i? = 2, we use two 
additional RVQ codewords and for B = 4, we use iVstat = 3, A^,^^ = N^^^ = 3, iV,^^ = 2 and iV^vq = 5. 
The third statistical codeword when i? = 4 is [U3 ui U4]. Fig. |4] illustrates the bridging of the gap in 
mutual information between the optimal and the statistical schemes. It is important to note that both the 
channels studied here are so constructed to result in a substantial performance gap between perfect CSI 
and statistical signaling. In more realistic channels that are not so poorly matched, we expect an even 
better performance with our scheme. Thus our studies illustrate that substantial gains can be achieved 
even with few bits of feedback. 
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VII. Concluding Remarks 

In this work, we have studied linear preceding under a realistic system model. In particular, the focus 
is on the impact of spatial correlation when perfect CSI is available at the receiver, statistical information 
is available at both the ends, and quantized channel information is fed back from the receiver to the 
transmitter. While initial works on precoding assume perfect CSI at both the ends and hence do not 
impose any particular structure on the precoder matrices, under the model studied here, we see that 
structure can help in minimizing the reverse-link feedback as well as ease the implementation complexity. 

We introduced the notion of matched and mismatched channels and illustrated that limited feedback 
precoding is useful only in the case of mismatched channels. The study of statistical precoding motivates 
the proposed limited feedback design where we quantize the space of semiunitary matrices with a non- 
uniform bias towards the statistically dominant eigen-modes. The design as well as its adaptabihty are 
rendered practical by the construction of mathematical maps (operations) that can rotate and scale subman- 
ifolds on the Grassmann manifold. More importantly, numerical studies show that the proposed designs 
yield significant improvement in performance when the channel is mismatched to the communication 
scheme. 

This work is a first attempt at systematic precoder codebook design in single-user multi-antenna 
channels that exploits spatial correlation. Possible extensions are the study of more complex receiver 
architectures and performance analysis in the finite antenna, arbitrary SNR setting, along the lines of [29]. 
More work also needs to be done to understand the impact of spatial correlation on the performance of the 
proposed limited feedback scheme which could in turn drive the development of more efficient codebook 
constructions. Open issues that need further study include practical aspects hke codebook designs for 
wideband channels, codebook designs based on Fourier/Hadamard matrices that are useful in achieving 
the bounded gain power amplifier architecture and hence, have found much interest in the standardization 
community, incorporating the cost of statistics acquisition in performance analysis [41], and more general 
scattering environment-independent channel decompositions [42] that mimic the physical model closely. 
The case of multi-user systems with feedback, which has attracted significant recent interest, is another 
area for study. 

We close the paper by drawing attention to the philosophy that has guided this work. While deducing 
the structure of the optimal signaUng scheme under general assumptions on spatial correlation and channel 
information seems extremely difficult, an alternative approach that partitions this problem into smaller 
sub-problems could be quite fruitful. The general idea of matching the rank of the precoding scheme to the 
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number of dominant transmit eigenvalues with the resolution necessary to decide whether an eigenvalue 
is "dominant" or not being a function of the SNR reminds one of the classical source-channel matching 
paradigm [43]. Initial evidence seen in this paper also suggests that this partitioning provides a natural 
framework to understand the performance of limited feedback schemes. 

Appendix 

A. Proof of Theorem \J\ 

The efficacy of the rotation map is established if we can show the following: 

1) Gf Gi = Im for all i, 

2) Gi = Vtarget, and 

3) dproj,2(Gi,Gi) = rfproj,2(Vi, Vi) for all i. 

To prove 1), first note that Uvi and Uvt,,^^, are Nt x Nt unitary matrices. From the semiunitarity 
property of Vj, G^Gj = Im follows trivially. Using the unitary property of Uvi and the decomposition 
in the statement of the theorem, 2) also follows trivially. For 3), note that 

C^proj,2(Gi, Gj) = Amax (GiGf^ — GjGf^) 

= A^ax (Uv.,,..U^, (ViVf - V,Vf ) Uv,U^.,,^^.) = dproj,2(Vi, V,). (22) 

In the above chain of equalities, we have used the fact that A(AB) = A(BA) and the unitary property 
of Uvi and Uva^jet- Thus the proof is complete. ■ 

B. Generalized Scaling Map 

Theorem 2: Let 7^ be a root codeset with packing density 7 and center Vi. Let Ua and W be 
arbitrary M x M unitary matrices and let Ub be an arbitrary {Nt — M) x {Nt — M) unitary matrix. 
Given a £ (0, 1) and M < {Nt — M), for any Vj G TZ, generate an M x M diagonal, positive-definite 
matrix Aj with: Amin = miiij Aj(j) = I - {l - Amin(Vf V^Vf Vi)) and A^ax = max^ Ai(i) < 1. 
Then, define Aj as Aj = Ua A|^^ W^. Define the M x M principal component of the {Nt - M) x M 
diagonal matrix Ab as (Im — Aj) ' and Bj as Bj = Ub Ag W . 

If M > {Nt - M), for any Vj G TZ, generate an {Nt - M) x {Nt - M) diagonal, positive-semidefinite 
matrix Vi with: Tmax = maxj Vi{j) = a'^ {l - Amin(Vf VjVf Vi)) and Tmin = miiij Vi{j) > 0. Then, 
define Bj as Ub Ag ^ with the principal {Nt — M) x {Nt — M) component of Ab being Tj. Define 
Aj as Aj = Ua Aa with the principal {Nt — M) x {Nt — M) component of Aa being Iat^-m — Ti 
and the principal southeast component being l2M~Nf 
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Then, the scaUng map s(-) that leads to a packing density of 7a is given by 



s(V,) = ViA, + Vr"Bi (23) 

where V""" is a representative of the null-space corresponding to Vi. 

Proof: Let ru denote the rotation effected by a unitary matrix U. Since the scaling operation has to 
keep the center of a root codeset fixed, in the sequel, we use a fixed Nt x M matrix as the center instead 
of Vi which is dependent on the choice of IZ. This is achieved by a composition of three operations: 

s(-) = •^vertex TyjH^ (•). (24) 

Here, r^H (•) rotates the root codeset to the canonical precoder [Im OM>i{Nt-M)V while Svertex(-) scales 
(shrinks) the canonical codeset by a factor a and ruv^ rotates it back to the direction corresponding to 
Vi- From the above definition of s(-), we have 



'vertex 



vfv,. 



Vi V 



null 



ViA, + Vr"Bi 



In this 



where we have used a partitioning [Aj 'B>JY for the Nf x M matrix Syertex 

partitioning, Aj is M x M and is of full rank while Bj is an {Nt — M) x M matrix. 

Given that VfVi = Im, VfVJ"" = Omx(7V,-M) and v;'"'^V;"" = I^,_a/, the relationship 
Ai + B^^Bj = 1m ensures that s(Vj) is semiunitary. We show that Aj and Bj have to be as in the 
statement of the theorem so that the following properties of s(-) are met: 

1) rfproj,2(s(Vi),s(Vi)) = adproj,2(Vi, Vi) for all i, and 

2) s(Vi) = Vi. 

First, let us consider the distance scaling property. Assuming 2) (which we check subsequently) and 
following Prop. [T] we need 



A, 



(25) 



.(C) = A^ax (ViVf - s(VOs(V.)^) = a^l - A^i„(Vf ViVf Vi 

where C ^ ViVf - ViA^Af Vf - Vr"BiAf Vf - ViA.Bf Vj^"'^ - Vr"BiBf V;"""'^. In the 
expansion for C, we have used the relationship in (l23l) . We can decompose C as C2Cf where 

Vf 



Afvf 



BfVi 
B^V"""'^ 



Vi(lM-A,Af) -VJ^i'Bi -ViAi -Vi-"Bi 



(26) 
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Note that the non-trivial eigenvalues of AB are the same as those of BA. Hence, Ajnax(C) = Amax(Cf C2). 



Using the facts Vf Vi = Im, Vf V^""" 
4M X 4M matrix C2 is given by 



O 



Mx{Nt~M) 



and v"""'^V5'"" = In.^m, observe that the 



Cf C2 



Im - Ai Af 


Om 


-Ai 


Om 


Af(lM-AiAf) 


Om 


-AfA, 


Om 


Om 


-BfB, 


Om 


-BfB. 


Om 


-BfB, 


Om 


-BfB. 



(27) 



We will now show that the largest eigenvalue of C2 can be computed in closed-form. For this, 
we need to solve for A by setting det(C]^C2 — AI4M) = 0. Towards this computation, we need to use 
Lemma |2] following which, we have 

^ Im - A,Af - AIm Om -A, ^ 

det Af(lM-AiAf) -AIm -A,^A, -(28) 

Om -ABfB,(B^B, + AlM)-' -AI, 



det(Cf C2 - AI4M) 
det(-Bf Bi - AI 



M) 



V 



With K = det(-Bf B, - AIm) det(-AlM), upon another application of Lemma |2] we have 



det(Cf C2 - AI4M) 



det 



K 



Im — AjAj — AIm 
Af(I-A,Af) 



-AIm + Af AiBf Bi(Bf Bi + AIm 



(29) 



which can be simplified to 

det(Cf C2 - AI4M) 



= det(-Bf Bi - AIm) det(-AlM) det(-AlM + Af A^Bf Bi(Bf B^ + AIm)"') 
X det(-AlM - AAr^(-AlM + Af A,Bf B,(Bf B, + AlM)-')-'Af (Im - A^Af )). 
Note that det(C^C2 — AI4M) = has 4M solutions for A with the solution from the first two terms 
being non-positive. Setting the fourth term to zero, and using the facts that det(I + CD) = det(I + DC) 
and Im = A^Aj + B^Bj, we see that A has to satisfy: 

- 1 - Ai((-AlM + Af A,(Im - Af A,)(Im + AIm - Af A^r^'ilM - Af A,)) = 0. (30) 

After some straightforward simplifications, we can check that A is a solution to 

A,(A(Im - Af Ai)-i - Af Ai(lM + AIm - Af AJ-^) = L (31) 

Assume a singular value decomposition for Aj and Bj of the form: Aj = UaA^^W^ and Bj = 
UfiAg ^ Wg, respectively where Ua, Wa and Wb are M x M unitary matrices, and Ub is an {Nt — 
M) X {Nt — M) unitary matrix. The fuU-rankness of Aj means that the M x M diagonal matrix Aa 



H 
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is positive definite while the {Nt — M) x M matrix Ab has non-negative entries only along the leading 
diagonal. Since Af Aj + Bf = Im, we have 1m - Aa = Wf WB(AgAB)^/^Wf Wa- Comparing 
the two sides, we see that Wa = Wb (we set both to be W) and — Aa = (A^Ab)^/^. Note that 
since there are no constraints on/relationship between Ua and Ub, the leading diagonal entries of A a 
and Ab can be in any order. This is because either unitary matrix can be appropriately adjusted by a 
permutation matrix. 

Plugging in Af Ai = WAaW^ in (|3TI ). a routine computation yields M solutions to A of the 
form: = 1 — With the same form of A^Aj, by setting the third term to zero, we obtain 

another M solutions A = ■ ^Vl±^^^Mp/l±^y Note that Vl±^^^Mp/l±^ < i 

and hence, Amax(C) is obtained by setting i = M in the above solution which results in Amax(C) = 
-y/l — Amin(A^A). Using this in (l25l) . we get the expression for Amin(A,^Aj). Furthermore, Im = 
Af Aj + Bf Bj impUes that 

1 = A„,ax(Af Ai + Bf B,) > A„,ax(Af A,) + An,in(Bf B^) > {A„,ax(Af A^), A„,ax(Bf B,)}. (32) 

These are the constraints to be imposed on Aa to ensure that the scaling map preserves semiunitarity 
and reduces the minimum distance by a. 

If M < {Nt — M), without loss in generality assume that the diagonal entries of A a are in non- 
increasing order while those of Ab may be not. Given a choice of Aa, the condition Im — A a = 
(A^Ab)^/^ can be met by choosing the principal M x M component of Ab to be (Im — Aa)^''^- If 
M > {Nt — M), assume that the diagonal entries of Ab are in non-increasing order while those of Aa 
may be not. Then, the condition Im — Aa = (A^Ab)^^^ can be met if 2M — Nt entries of Aa are 1. 
The additional constraint on the smallest diagonal entry (see discussion above) ensures distance scaling. 

To close the theorem, it is necessary to verify that s(Vi) = Vi. This can be done by checking that Aj 
can be computed in closed-form. For this, note that Amin = 1 and since A^ax < 1, we have Aj = Im- 
From here, it can be checked that Bj = Oi^i^^_m)xM and from (|23] ). we thus have s(Vi) = Vi Ua W^. 
On the Grassmann manifold Q{Nt,M), multiplication by an M x M unitary matrix results in the same 
"point." Thus s(Vi) = Vi and the proof is complete. ■ 

Note that the choice of the scaling map is non-unique due to freedom in the choice of Ua, Ub and 
W as well as the eigenvalues of Aj and Fj. The case of Vj = Vi is special where Aj turns out to 
be Im- With almost any other choice of Vj, these matrices are non-identity, in general. Besides these 
choices, non-uniqueness of the representative of V""" also leads to non-uniqueness of the map. 
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